Fast k-clustering Queries on Road Networks
نویسندگان
چکیده
In this article, we study the k-clustering query problem on road networks, an important problem in Geographic Information Systems. Using Euclidean embeddings and reduction to fast nearest neighbor search, we devise approximation algorithms for these problems. Since these problems are difficult to solve exactly – and even hard to approximate for most variants – we compare our constant factor approximation algorithms to exact answers on small synthetic datasets and on a dataset representing Tallahassee, Florida, a small city. We have implemented a web application that demonstrates our method for road networks in the same small city. Keywords-k-clustering, k-means, k-medians, k-centers, embeddings, Computational Geometry, GIS.
منابع مشابه
Top-k Spatial Preference Queries in Directed Road Networks
Top-k spatial preference queries rank objects based on the score of feature objects in their spatial neighborhood. Top-k preference queries are crucial for a wide range of location based services such as hotel browsing and apartment searching. In recent years, a lot of research has been conducted on processing of top-k spatial preference queries in Euclidean space. While few algorithms study to...
متن کاملeSPAK: Top-K Spatial Keyword Query Processing in Directed Road Networks
Given a query location and a set of query keywords, a top-k spatial keyword query rank objects based on the distance to the query location and textual relevance to the query keywords. Several solutions have been proposed for top-k spatial keyword queries in Euclidean space. However, few algorithms study top-k keyword queries in undirected road networks where every road segment is undirected. Ev...
متن کاملModification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملA link-based storage scheme for efficient aggregate query processing on clustered road networks
The need to have efficient storage schemes for spatial networks is apparent when the volume of query processing in some road networks (e.g., the navigation systems) is considered. Specifically, under the assumption that the road network is stored in a central server, the adjacent data elements in the network must be clustered on the disk processing of network queries. In this work, we introduce...
متن کاملStreaming Algorithms for k-Means Clustering with Fast Queries
We present methods for k-means clustering on a stream with a focus on providing fast responses to clustering queries. When compared with the current state-of-the-art, our methods provide a substantial improvement in the time to answer a query for cluster centers, while retaining the desirable properties of provably small approximation error, and low space usage. Our algorithms are based on a no...
متن کامل